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Abstract 

Optimizing activation and deactivation ol ba« station transmissions provide* an instrument lor 
impiovmg energy efficiency in cellular networks. In tins paper, we study live problem of (vitnrnung 
cell clustering and setting the activation lime ot each cluster, with the objective of minimizing the sum 
energy, subject to a lime constraint ot serving the users' irattic demand. Our optimization framework 
account* lor inter cell interference. and. thus, the useis* achievable rales depend on cluster I ormaimn 
We pmv ide mathematical fotmulafions and analysis, and prove the problem's NT lurdness. For problem 
solution, we lirsl apply an optimization mctlwid llul successively augments the set of variables unifcri 
consideration, with the capability ol approaching global optimum. Then. we derive a second volution 
algotlthm to deal with the the trade-oil between optimality and the combinatorial nature nl cluster 
lonnation. Numerical results demonstrate lliat our solutions achieve more than 40** energy saving over 
existing schemes, and that the solutions we obtain am within a few percent of deviation from global 
optimum. 

Index Terms 

cell activation, cell clustering, energy minimization, load coupling, column generation. 

I. Introduction 


Energy efficiency has become a major concern for cellular networks due to the explosive 
growth of data traffic. Among Ihc system elements, base stations (BSs) account far more than 


80% of the total energy consumption QU- calling lor new approaches for BS operation. To this 
end. one solution is to coordinate and optimize the activities of BSs. and the paradigm of BSs 
operation has been shifted from “always on'* to “always available* |2|. Some underutilized BSs 
with low traffic can be turned ofl. for example, to reduce the energy consumption, if the data 
traitic of the BSs can be offloaded to other BSs. Another related scheme tor energy saving is to 
organize the BSs by clusters such that one cluster is active at a time. The cells within a cluster 
arc in transmission if and only if the cluster is active. In this paper, we optimize cell cluster 
formation and the activation time duration of each cluster, with energy as the performance metric. 

A. Related Works 

There arc a number of studies that consider energy saving by deactivating BSs iBI-Pl- In these 
works, the periodic nature of cell’s traffic, both temporally and spatially, is exploited. Energy 
consumption is reduced by deactivating some BSs when the traflic demand is low. If a BS is 
deactivated, its service coverage is taken care of by other neighboring BSs that remain active. 
Coordinated Multi-Point (CoMPi transmission can be applied, sec c.e.. 0. to avoid coverage 
holes. 

Energy saving can also be gained by deactivating BSs* power amplifiers (PAs) if the amount 
of tmfhc docs not require fully continuous transmission. In the transmission mode, the PAs are 
accounted for most of the energy consumption. Typically. 50-80% of the total energy of a BS 
is consumed by the PAs QJ. For long term evolution (LTE) systems, deactivating the PAs can 
he done hy adopting discontinuous transmission (DTX> at the BSs. implemented by the use of 
Almost Blank Subframe tABS> |T7J. In QQ, performance evaluation of DTX is earned out for a 
realistic traffic scenario. 

In BS scheduling, the BSs arc grouped into clusters that potentially can overlap, such that one 
cluster is active (Lc., used for transmission) at a time, and a schedule is designed to optimize 
the use of clusters to serve the user demand with minimum energy. In |£IJ. the author* assessed 
the performance of coordinated scheduling of BS activation. In this ease. inicr-BS coordination 
is earned out for groups of three cells, with pre-defined and fixed deactivation period of each 
BS. In |9J. the authors propused a coordinated activation scheme, in which the BSs arc split 
into multiple BS groups. For each group, the BSs switch t*Mwccn activation and deactivation 
according to a pre-dehned pattern. Simulation results in J9] show that the .scheme leads to 40% 


less energy consumption. In flO] . the authors considered lour BS deactivation patterns, to allow 
for progressively deactivating BSs to improve energy efficiency, while maintaining the quality of 
service (QoS). Energy saving is achieved by dynamically selecting the four patterns adaptively 
depending on the traffic demand. 

Another related topic is transmission scheduling in wireless ad hoc and mesh networks (see. 
l ira Ifni , and the references therein). The task is to organize links into groups, and 
determine the number of time slots assigned to each group, in order to meet the demand with 
minimum time la.k.a. minimum-length scheduling). A subset of links can form a group if and 
only if tiic signal-to-intcrference-and-noisc ratio (SINK) at the receivers meets a given threshold. 
A problem generalization to continuous rales is studied in fO| . In 1131 . the authors studied 
transmission scheduling in mesh networks with a performance metric that weights together time 
and energy. 

B. Our Work 

Most of the previous works for coordinated BS activation focus on saving energy enabled 
by scenarios with relatively low f user demand. For the more general scenario with no specific 
assumption on user demand level, energy-optimal BS scheduling for delivering the demand within 
a strict time limit is challenging, due to the fact that the achievable transmission rates within 
each cell arr constrained by the inter-cell interference. For LTE networks, the transmission rates 
<i.c.. demand delivered per time unit) in different cells are inherently coupled with each other 
due to mutual interference, 'lo characterize the achievable rates, we adopt the coupling model in 
G3-II8I lor cell load-dependent SINK. Here, cell load refers to the utilization level of the time- 
spectrum resource units tRlIs) in orthogonal frequency division multiple access (OFDMA). The 
cell load levels arc coupled, i.c.. they influence each other. Namely, because the load reflects the 
amount of use of RUs for transmission, the inter-cell interference generated by a cell to another 
cell depends on the load of the former, and the interference, in its turn, has impact on the load 
level ot the latter. In the load-coupling model, the dependency relation of the cell load levels 
is taken into account in the SINK computation. To the best of aur knowledge, eneqjy-clfiricnt 
BS clustering and scheduling, subject to maximum delay and rate characterization bused on the 
coupling relation among cells, has not been investigated in the literature. 

In this pjper. we formulate, analyze, and solve encrgy-cflicicnt cell clustering and scheduling 


(CCS), where ihc cells are required to serve a target amount of data for the user* within a time 
limit to maintain an appropnate level of QoS. while considering the coupling relation among 
cells due to interference. Each cluster is a subset of cells that are in simultaneous transmission 
mode, when the cluster is active. Instead of prc-dctaned clusters, in CCS cell clustering as well 
as cluster activation times arc optimized. Within a cell, the achievable rate vectors for the cell's 
users, taking into account inter-cell interference, is not unique but form a rate region. Thus 
solving CCS also involves the selection of rate vectors. 

We present the following contributions. First, we formulate CCS and prove its NP-hardness. 
A problem is called non-dctcrmmistic polynomial-time hard, or NP-hard in short, if it is at 
least as difficult as a large class of computational problems referred to as NP. and. thus far. no 
polynomial-time algorithms exist for NP-hard problems. As the next contribution, we present 
and prove a theoretical result to enable to coniine the consideration of rate vectors to a finite set 
without loss of optimality. On the algorithmic side, we show how column generation iflD . 11911 . 
1201 facilitates problem solving, and thereby derive an algorithm for optimal cell clustering and 
scheduling (AOCCS) to approach the globjl optimum. Column generation is an optimization 
method, in which a mathematical model is successively expanded with new variables, such 
that the objective function gets improved after each expansion, until the global optimum is 
reached. By our complexity results of computational intractability, for large networks solving 
CCS optimally is challenging. We then introduce our notion of locally enumerating interference, 
that is. for each BS. the rate evaluation of its users considers a selected small set of nearby BSs 
as sources of interference, utilizing the fact that interference from distant BSs is insignificant. 
Using this notion, wc present a loeal-enumeration-hased bounding scheme iLLBS.l, providing 
lower and upper hounds on the global optimum of minimum energy, as well as enabling to deal 
with the trade-off between optimality and the combinatorial nature of cluster formation. The 
bounds, in turn, sene the purpose of gauging the deviation from optimality. Moreover, from 
LliBS. we derive a near-optimal cluster scheduling approach (NCSAI. Wc present numencal 
results to illustrate the performance of the proposed approaches. The results show significant 
energy savings by AOCCS, and the near optimality of solutions enabled by LliBS and NCSA. 
Wc remark that, even though regular, hexagon-shaped cells are used for performance evaluation 
for the purpose of comparative study, our system model and the optimization approaches do 
not impose any topological assumption, and hence they ore generally applicable to any given 




cellular network layout. 

Ihe rest of the paper is organized as follows. Section 11 gives the system model. In Section 
111, we formulate CCS and prove its complexity. Section IV presents algorithm AGCCS. Section 
V details the LEBS scheme and NCSA. Numerical results arc given in Section VI. Section VII 
concludes the paper. 

Notations: We denote a (tall) vector by a bold lower case letter, say a. a matrix by a bold 
capital letter, say A. A set is denoted hv a letter in calligraphic style, say A. Notation < and 
< are lor componentwise inequalities between vectors. 

II. System Model 

A. Ct'llular Network w ith Cell Coupling 

Consider a downlink OFDMA based cellular network with / BSs serving J users. We use 
1 = and J ■ {1,..., J} to denote the sets of BSs and users, respectively. The set of 

users of BS t is denoted by Ji . and user sets of all BSs form a partitioning ol J. Let J% = \ Ji[* 
we have J% - J• Throughout the paper, we refer to BS i interchangeably with cell i. In 
OFDMA. the time-frequency domain resource is divided into resource units (RUs). A cell serves 
its users by orthogonal ti.e.. non-overlapping) use of the Rifs. We use dif to denote the traffic 
demand (in bits) of user j in cell i. As a QoS requirement, all users' demands have to be served 
within time T. 

In the load-coupling model, the SINK computation over one Rll uses the cell load levels to 
take into account inter-cell interference. In the following, we derive the SINK of one RU for user 
j of BS i. We denote by p% the transmission power per RU of cell i. and gt) the channel gain. 
The noise effect is denoted by 17 , which equals the power spectral density of white Gaussian 
noise times the bandwidth of a RU. For inter-cell interference from another BS k (k $4 <). we 
use pit and ghj to denote the corresponding transmission power and channel gain with respect to 
user j. Note that interference is zero if BS k is not utilizing any resource. Following fT5| - Usj| . 
we use the resource utilization level of BS k as a scaling factor in interference modeling. With 
the given notation and discussion, the SINR of user 3 in cell i is formulated below. 

_ Pt9ij _ 
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In (T). entity l * is referred to as cell load, and denotes the utilization level of RUs in cell 
k. that is. the proportion of RUs allocated for transmission. The load vector is denoted by 
I ■ pi t . * .. Jj] T * In IIHI . it is shown that utilizing resource fully, i c.. I ■ 1 is optimal 

from an energy standpoint. However, operating at full load means there is no spare OFDMA 
resource units. For the sake of generality, our system model is formulated for any preferred 
load level, with 0 < l < 1. Note that in iQ}. the product Pk9k)W represents the amount of the 
interference from cell k to user j. The interference is Gaussian distributed in the worst ease. 
Therefore, by using Gaussian code, the achievable rate, in bits per second, for user j on one RU 
with bandwidth B is computed as B log 2 (l -f SINR^). where B is the RU bandwidth. Therefore, 
to deliver a rate of ry to user j of cell i. 'H"logjlTVS 1 VR ~ > " KUs ^ required. Let W denote the total 
number of RUs per cell. 'Ihe corresponding load. i.e.. the proportion of the RU consumption of 
cell i due to serving user j, is thus l tJ m Observing that l t * ^ or cc ^ 

i gives the following equation. 
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Without loss of generality, for convenience we normalize such that WB * 1. From iQi. one 
can observe that the usent* rates cannot be set independently from each other. Moreover, to 
satisfy the QoS requirement, the rate values have to be chosen such that the demand is delivered 
within time T for all the users, that is. Tr tj > € Jo Vi € X. 


B. Multi-Cell Clustering 

In Section ITl-Al we have given the basic elements of the system model assuming that all cells 
arc in transmission mode. This may very well be feasible in meeting the QoS requirement, i.e.. 
one can find rates for © such that all demands arc delivered within time T. '11 k strategy, however, 
may not be energy-optimal. We now consider multi-cell clustering for energy optimization. 
A cluster refers to a subset of X. such that the BSs in the subset arc either all activated or 
all deactivated. For all possible 2* — 1 non-empty subsets of X. denote by *S the index set: 
5 » — 1}. Each index s € S maps tn a unique subset of BSs. Let X* denote the 

corresponding set of cells of element s € S. Scheduling cluster s means that all the BSs in set 
X* arc activated to be in transmission mode to serve their xssociated users, whereas all the BSs 
in arc deactivated. In the latter ease, the BS radio components arc turned off and no data 






can he transmitted. Ilierc is a transition time between activation and deactivation modes J8J. 
The transition time is however much smaller than the entire scheduling period Q* and hence 
we consider the transition time to be zero in this paper. 

Consider a cluster s with BS set Z*. The equation *0 lor cell i € Z* takes the following form. 
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Herr, represents the rate allocated to user j in cell i within cluster s. As /< represents a 
preferred resource utilization level of BS t. in this paper it is set independently of cell clustering. 
Note that in iQ'. the user rates arc the variables. By inspecting 0. we observe that it forms a 
linear equation system of the user rates. We introduce the following entity. 

1 


Then ls simplified to the equation below. 


,Vj G Ji.'/i G 2».V.s G S 
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For each cell i. its users arc served when cell i is active. Thus, as \vc assume there is at least 
one user per cell, every cell must be activated at least once. or. to be precise, every cell must 
be included in at least one cluster that has positive activation time. Note that a cell may be in 
multiple and active clusters. For these clusters, the achieved rates of the cell's users and the time 
durations of the clusters together determine the amount of served trafiic. which must meet the 
individual demand requirement within the spccitied time limit. 

We would like to point out that the system model focuses on downlink. To support the 
downlink, some control traihe is necessary in the uplink. This can be implemented by using 
time division duplex (TDD) or frequency division duplex (FDD), as defined in 3GPP* 


Remark: For any cell i € Zj. there arc infinitely many rate allocations satisfying 0. Thus one 
can choose to activate a cluster multiple times but with different rate allocations. In our system 
model, only one rate allocation is to be selected for each cluster. However, as will be clear later 
on. this seemingly strong restriction docs not impose any loss of generality. □ 






III. The Energy Minimization Problem 


A Problem Fommlurion 

Encrgy-cilicicnt CCS consisLs of determining the clusters that shall be activated and the 
respective activation durations, and the optimal user rate allocation within each cluster, such 
that the sum energy is minimum and the users* demand arc met within the time limit. For power 
consumption, we adopt a model that has been widely used (c.g.. (S|. 12H . 122| L The power of an 
active US i equals p = po + UWp%. The hrst component po is load-independent to account for 
the auxiliary power consumption due to processing circuits and cooling. The second component 
represents the transmission power with respect to the resource usage of US t. For an inactive US. 
the power consumption is considered negligibly small and assumed to be zero. Thus the power 
consumption of cluster s is p s ■ P? 0 *. In the following we formally define the vanablcs 

and formulate the CCS problem. 

In PI. the objective (unction i^a) expresses the sum energy, by taking the product of the sum 
power of each cluster and its scheduled time duration. The QoS constraints (oFl and i tTuT arc 
imposed to ensure that the required demand is delivered within the time limit. Note that i is not 
a running index in the lell-hand side of l ihbl . The use of i in the subscript of the summation 
is to exclude clusters that do not contain cell i. Here. ~: M means “such that** in an optimization 
problem formulation. Equations i |(kI) define the rate region. 


x M m The time duration of activating the USs in cluster 5. 
rjj m The rate allocated to user j of cell i € £«. s € «S. 

PI : min y^pjXj 

s.i. Y, VjeJ, ( v,e J 

tSMX. 
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Wc collect the user rate variables r\ } and their coefficients 6^ of cell t in cluster s as column 
vectors rf and respectively. Then i lfafl has the following compact iorm. 

Wfr; . V. € I., Vs e S (7) 

We note that (7) defines a simplex, which is a special type of J t -dimensional polytopc, as the 
rale region of users of cell i in cluster a. Any point of this polytopc represents an achievable 
rate vector, and vice versa. Wc use 72J to denote the simplex for cell i € I M in cluster s. 

Formulation PI is non-linear and non-convex. due to the product in ijnfi) From the discussion 
above, in general there arc lnlinitrlv many possible rate vectors. However, wc will show this 
non-linearity can be overcome without loss of optimality. 

Remark From the cell clustering problem is more general than BS partitioning. At optimum 
of CCS. a cell may be in multiple active clusters with different time durations. □ 


B. Linenr Formulation of CCS 

Our first result is provided in Lemma [T] and Theorem (3 The result enables PI to be trans¬ 
formed to a linear but equivalent form with a finite number of rate allocations. 

Lemma 1. Any solution to Problem PI can be equivalently represented using a finite number 
of rate vectors. 


Proof: Fur any cluster a and cell i € I*, the simplex, denoted hy 72J, is defined in ©. 
Without loss of generality, suppose the user indices of an arbitrary cell i € Z* is l r ..., J<. and 
I M = |2j|}. Simplex 72f has exactly J t vertices where r M t '* is the column 

vector having y#- as its jth element and /.cm for all the other J % 1 elements. Because 72* is a 
convex set. any vector r* € 72J can be represented as a convex combination of t r, J \ 

that is. there exist scalars Oj > 0, such that r* = 0irf J -f 0 2 r[' 2 *F ••• -F 


“d “ 1- 


Suppose cluster a is activated with time duration x% and rate vectors r*. i € Z». For cell 
i. the vector of the amount of served user demand is given hy multiplying scalar x* with 
the rate vector of this cell, i.c.. x*r*. By the observation above, r* ■ • Hence. 


T»rf rn 52 j€ji XsOjr*'* rn x i <?ii^-,0 l ...,0] r + + By this substitution. 


i 


■ 

r 4 rf is equivalently expressed by a weighted sum of rule vectors, each of which has one non-zero 
rate value. 

For cluster s, denote by r g the column vector obtained by stacking r{ r .* lc., r* ■ 

[(rf)^ t ...,(rj^|)^] y - Activating cluster s with time duration x$ (which is a scalar), the amount of 
served user demand of the cluster, in vector form, is x B r* * [i f (r*)*,... ,x # (r') T . - • •, **(rpr < |) T ] T . 
Applying the substitution step r* ■ 22 Jt r t observing that ^ t?j = 1. we obtain 

r.r* - 0i[r.(r,*) T , - - •, *.(r,' J ) r , - - -,*»{rj , Ii| ) T ] T + Oj, [x,(rf) T ,.... x,(r,*’ J, ) T , - - •, 

r *( r |Z J |)* 1* • I ben. repeating the substitution procedure for the other cells leads to the conclusion 
that the effect of activating cluster s with any rale vector r* can be equivalently achieved by 
combining at most IIki. Ji different rate vecturs, and the lemma follows. ■ 


Remark: Lemma Q] further sheds light on the remark of Section HII-AI Consider a solution in 
which a cluster is activated multiple times with different rate allocations. Because each of them 
is equivalent to a combination of the rate vectors from the same timte set. the activations can be 
aggregated into one activation, tor which the rate allocation is derived from the coefficients used 
in the combinations. Therefore considering one rate allocation per cluster in problem formulation 
PI does not cause any loss of generality. □ 





Let denote the set of vertices of Collecting one element of each v g . i € X M , leads to a 
column vector representing a rate allocation, in which exactly one of the users in every cell has 
positive rate. Lnumerating all such combinations amounts to taking the Cartesian product of sets 
v,. Vi € X a . This gives in total rale vectors, which we index by C g ■* {1, .. .,11^-^./*}. 

As an example, consider a cluster s of two cells Z t ■ {i t &} with three user* in each cell: 
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Ji ■ {1.2,3} and - {4,5.6}. The corresponding rate vector* in C M can be expressed by a 
4-by-KM matrix A*, where ^ ® and K*l “ 9. 

TTic vectors with index set C g . i.e.. the columns in A * for the example, are all feasible rate 
allocations tor cluster 5, satisfying Equation < EkTV Wc denote the rate allocated to user j in c € C t 
hy r£.j e 3. i € 2#t and c € C#. For each i (E Z*. their is one single user J € 3i for which 
rff m jj-. whereas the other users of the cell have zero rates. For example, in the first column 
( 0, 0. 0,0 ] r of A\ users I and 4 arc allocated positive rates rtf * p- and r# ■ 

cell I ^ cell 4 

in the two cells, respectively. 

Wc assign variable x*. for c € C § to indicate the activation time. Next, wc reformulate PI as 
a linear fnrmulatiun P2. in which Xj L - > 0 are variables, wheiras the rates air not. 


n 


Activation time of cluster s with rate index c € C ff . 



(9a) 

‘<s c«C. 


Y £ > d »yj e J„ v. e J 

(9b) 

•tS.lfl. t*C. 



<9c) 

•<5 etc. 


*«>o ,VceC„v a e5 

(9d) 


’llie constraints in P2 have the same meaning as the first two inequalities in PI. As P2 is 
irstneted to a given and finite set of rate vectors, the formulation is linear. 

Recall that in PI. user rate is an optimization variable, and. for each cell in a cluster, the 
users’ rales arc subject to i|hd) which dctincs the rate region that is a simplex. In P2. rjf is a 
not a variable. Specifically, rfj, j € form a vector coiTcsponding to a vertex of the simplex 
defined by i Rufl . Utilizing the fact that any point of a simplex can be equivalently represented 
by a convex combination of the vertices of the simplex <cf. Lemma Q}. in P2 the rate vectors 
representing the vertices are used instrad of »(53)i Hence the /-parameters and 6-parameter* do 
not appear explicitly in P2. Rather, they arc used in calculating the vertex vectors of the simplex. 

It is instinctive to illustrate Lemma 0 by an example. Consider a single cell i € Z* serv ing 
three users J x - (1,2,3}. Figure 0 provides an illustration of the rate region defined by 6^ rtf -f 



kfa r <3 + Wa r a * ft* This rate region corresponds to the surface of the triangle. The three vertices 
arc r< J,x — [j^,0, oj . r“ a * Jo. 0 . and r g ° ^ 0,0 r p-J . In PI. the rate vector r‘ has to 
he a point of the .simplex, that is. + 6J,rJ, + ■ l t . Setting 0 , ^ j - 1,2,3 gives 

0i 4* O 2 4 0j ■ 1 and r* ■ -f ftrf’ 2 4- ftr*' 3 . implying that rf is a convex combination of 
the three vertices, which arc used in P2. 


Theorem 2. PI and P2 are equivalent at optimum . 


Proof: From Lemma [I] any solution of PI can he equivalently stated by a combination of 
a finite set of rate vectors. In addition, from the construction of P2. the finite sets used in the 
proof of LemmaQ]arc exactly those in ©. It then follows immediately that any solution to PI 
has an equivalent solution in P2 Consider the opposite direction and take an arbitrary cluster 
s and its associated time durations x^, Vc € C g . in P^. For C t . denote by 
the corresponding rate vectors, all having length Ji- We define rate vector r* as follows, 

where x, m J2c(C. x «- 


r* ■ —r* 1 + —r* 2 + •. • + fl!£d r *l c -l < 10 ) 

i, x, x. 

By conslmclion in r* is a convex combination of r* 1 , r' 3 ,... , r^ c H Therefore for each 
cell i € its corresponding elements of r* is in R*. that is. r* is a feasible rate vector of cluster 
s in PI. Moreover, from i fTH . it is evident that activating cluster s with time duration x M and 
rate vector r* delivers exactly the same amount of demand as activating r jl , r Ji f .... r f ^ with 

durations _, T ^ al . respectively. Hence any solution of P2 has an equivalent solution 

in PI. and the theorem follows. ■ 
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C Problem Complexity 

Although P2 is linear, it is of exponential size in its complete form, because there air 2* — 1 
candidate clusters. However, in complexity theory, this fact, per se. docs not prove problem 
hardness, as a problem could be inappropriately stated in the formulation. Therefore, in this 
section we formally conclude and prove the hardness of CCS. 

Theorem 3. CCS is NP-hanL 


Proof: We give a polynomial-time reduction from the fractional chromatic number in graphs 
1231 . Consider a graph G with A r nodes. Denote by V(C) the set of all independent sets of C. 
and V(C t n) the set of independent sets containing vertex n. An independent set ts a set of non- 
adjaccnt nodes. Lc.. no pair of the nodes in the set ts connected by an edge. liach independent 
€ V(C) is associated with a non-negative variable x*. Finding the fractional chromatic 
number, which is NP-hard. amounts to (min Y2+ev{0.i\) x * — ljfi m 

The corresponding recognition version is to determine it there is a solution with 3Ci»cv(0 ) x * — K 
for a given number K . 

Consider the .special case of CCS with / = N BSs. each having a single user. Thus we can 
use US and user indices interchangeably. Let c denote a positive number with c < 2^ — 1. For 
any BS i € X. the parameters arc as follows: p t * 1. g n ■ r. I t =* 1, and d n m 1. Moreover. 
\V « 1. Po =■ 1. and Tf « e. l ; or any two USs i and k with t jL k . the channel gain ■ 1 if i 
and k arc adjacent in graph G. otherwise gtk m 0. The time limit T =* K. 

We prove that at optimum of the defined CCS instance, any two BSs connected by an edge 
in graph G will not be in the same cluster. Suppose the opposite, that is. at optimum there is 
some cluster s with time duration x$ > 0. and two USs i and k that arc adjacent vertices in 
G arc both present in I M . The cluster may contain additional BSs that arc adjacent to l or k. 
Consider the subgraph composed by the nodes in X t and edges between these nodes in graph 
G. Because i and k arc adjacent, there is a connected component in this subgraph containing i 
and k . possibly with additional BSs. Denote the nodes of this connected component by X B (i,k). 
Suppose we combine Z»\Z*(i,fc) with each individual BS in Z*(? t A:). Doing so gives |J»(i, k)\ 
cluster*, all with size \X M \Z J (i.A*)| -F 1. Consider activating these |Zj(i r /:)| new clusters, each 
with time duration ♦ |n place of cluster 5. For any BS in set X B \Z*(i t A*). the total time 
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of activation remains x*. and the rate equals that of the BS in I § . because by the definition of 
Z.(i, A:), there is no interference between the BSs in Z* \Zj(*.Ar) and those in Z»(i,A:). For any 
BS in Z t (i, fc). the rate is strictly smaller than in cluster s as I A (t,k) is a connected component 
in graph G. For *. for example, the rate is no more than log 2 (l + + ttt) < 

log 2 (l + 2* — 1) « Thus the demand delivered is less than fa. In the |Z v (i t A;)| new clusters 
defined above, the rate becomes I, and hence with activation time the demand delivered 

becomes which is higher than fa as |Z,(i. j)\ < N. Therefore, the amount of demand 

delivered via activating the \I t (i,k)\ clusters is no less than before. Consider the energy metric. 
For cluster s. the sum energy equals (1 + c)|Zj|x*. For each of the new clusters, the sum power 
is (1 -f c)(|Z» \Z*(i,fc)| -f 1). Because each is activated for time and there are |Z,(i, k)\ 

clusters, the sum energy equals (l4-c)(|Z J \Z i (s 9 fc)|+l)x # . This is smaller than the sum energy of 
clusters, because |Z*\Z*(? t A*)| < |Z # | — 2. Therefore, clusters cannot be optimal. In conclusion, 
at the optimum of the CCS instance, all clusters correspond to independent sets in graph G. 
As T m K, solving the CCS instance tor concluding its infeasibility) answers the recognition 
version of fractional chromatic number. As the latter is NP-compIctc. the theorem follows. ■ 

D. Two Simple IIS Scheduling Strategies 

1 "he previous analysis warrants the consideration of BS activation strategics that arc intention¬ 
ally simplified for tractahility. Here wc define Iwo simple schemes: l) individual activation of 
each BS; 2| simultaneous activation of all BSs. 

Definition 1. Using the notion uj lime Division Multiple Access tTDMAh o scheduling scheme 
is defined us "TDMA " if one US at u time is activated. 

The TDMA scheme reduces the number of possible clusters from 2* — 1 to /. i.e.. the total 
number of BSs. Utilizing Lemma QJ one observes that with TDMA. it is optimal to serve one 
user at a time, as formulated below. 

Lemma 4. For TDMA. then it is optimal for each IIS to sen e each of its users individually 
that is . TDMA at the BS level implies time-division access of the users of each BS as well. 

Proof: From the proof of Lemma [Q any achievable rate vector r^ MA of BS t under 
TDMA can be equivalently represented by a combination of serving one user m at a time 





Therefore. the TDMA scheme can he confined to deploying J % rate vectors, each having exactly 
one positive rate value for one user in J\. ■ 

From the lemma and iQ*- 4 !?). user j € is served with the maximum possible rate m 


(*k>ga(l 4* ^ 4 . Thus the time required for serving the user is t[j A ■ , |rg *‘ y<7 )- l hc 
optimality condition of TDMA is provided below. 


Theorem 5. TDMA is optima / for CCS if it is feasible . i.e., if t L° MA < ^ 


Proof: Suppose at optimum of PI. a cluster s of multiple BSs (i.e.. [Z # | > l) is activated 
with lime duration x,. and denote by r*. i € I* the rate vector allocated to BS i in the cluster. 
Consider replacing cluster s with |Z*| activations of the individual BSs in Z*. For any BS i € Z» 
with single-BS activation, the corresponding rate vector r t satislies r t t rf. because there ts no 
interference for single-BS activation and thus the b vector in iJTJi becomes smaller. Therefore, 
if each single BS of the cluster is activated with time Xj. Xjf< £ x^r*. lc., the demand that 
is served is no less than that of cluster 5 . Tlicrefore. to deliver the same amount of demand 
x $ r i to the users in any BS i € Z s . the time required by single-BS activation of ?. denoted by 
±4. satisfies xi < x§. 'Ihe energy consumed by cluster 3 equals ii tf*. With single-BS 
activations the energy consumption is improved to P? 1 **- The total time duration of the 

latter is JCiex, which however may be higher than Xj. Hence, as long as the time limit T is 
not exceeded, replacing cluster s with single-BS activations imprmcs energy, and the theorem 
follows. ■ 

By Theorem [3] TDMA is the preferred strategy for energy efficiency if the users* traffic 
demand is low such that it can be met by TDMA within the lime limit. Thus in this paper, we 
arc more interested in .scenarios of heavier traffic, for which TDMA is not time-feasible, i.e.. 

T.^ t T.a^ DUA >T- 

In addition to TDMA. we consider, as a simple and baseline scheme, the conventional strategy 
of having all BSs constantly activated. This scheme, as defined below, will be used as a 
benchmark tor performance comparison. 


Definition 2. A scheduling scheme is defined as "All-on" if all the BSs are constantly transmitting 
until all users' demands have been met. 


In All-on. one cluster d containing all the BSs is activated. Each BS serves its users with 




relatively lower rates due to the worst-ease interference. Denote by the transmission 

times required for meeting the individual user demands. 11k total activation time in All-on is 
a constant 7^.^ » max {ij t < 2 ** * • iO} which is the longest transmission time lor serving an 
individual user’s demand. If T > Tati-un. All-on is feasible and the sum energy is p*TmU-on* 
otherwise All-on is infeasible. Note that the rate vectors to be used are subject to optimization, 
and the algorithm in the neat section, i.c.. AlgorithmQ] can will used to obtain the optimal rates 
of * All-on'” for performance comparison. All-on in this paper is defined to be consistent with 
fTfo in order to enable a reasonable companson in Section [VT] 

IV. Optimization Algorithm for Cell Clustering and Scheduling 
A. Outline 

In this section, we propose and present an optimization algorithm for optimal cell clustering 
and scheduling (AOCCS). Consider formulation P2. It is in linear form, though the number of 
cluster* is exponential in network size. However, most of the clusters an: of no significance for 
constructing the optimal solution. In fact, as formalized below, one can conclude the existence 
of an optimal solution using at most J + 1 clusters. 

Lemma 6. For any feasible instance of CCS, there easts an optima! solution activating at most 
\J + 1| clusters . 

Proof: By theory of linear programming (LP) flffi . if an LP formulation is feasible and 
bounded, then at least one optimum is a so called basic solution. 11 k two conditions hold by 
the lemma's assumption and the stricture of P2. respectively. For any baste solution of P2 . 
the number of variables in the base matrix equals J *f 1. Le. t the number of* constraints. At an 
optimal basic solution, therefore, the number of x-vanables with positive values docs not exceed 
J + 1. and the lemma follows. ■ 

In view ol the size of P2 and Lemma 0 CCS should he solved in some other way than using 
P2 as is. Toward this end. we consider a column generation |[2T| approach lor solving CCS with 
guaranteed global optimality. The resulting algorithm AOCCS is based on a decomposition of 
P 2 into a master problem and a pricing problem. Tbe decomposition procedure keeps a small 
subset of candidate clusters in the master problem. The solution quality of the master problem 



is then successively improved by adding new clusters and rate vectors which arc generated from 
solving the pricing problem. 

ft The Master Problem 

'Hie so called master problem is a restricted term ol P2. A cluster along with an associated 
rale vector of each cell in the cluster arc jointly represented as a “column’ 4 . Adding a cluster 
and associated rates to the master problem is then equivalent to generating a new column in 
the coefficient matnx of PI. The master problem is presented below: the dilterence from P2 
is thjt the complete sets of clusters 5 and rate vectors C k arc replaced by subsets «S and C*. 
respectively, that arc successively augmented by new columns. 


P3: nun p.x*. 
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One iteration of AOCCS amounts to salving the master problem iJTT). and determining if 
augmenting HT} by a column li.c.. a cluster and an associated rate vector) that is not present 
in fTTI i can improve i ll lat . This is achieved by solving the pricing problem, to examine whether 
nr not there exists any new column with a negative reduced cost 1241 . 


C'. The Pricing Problem 

For the optimum of i fTT) . denote by and A* the dual variable values associated with 
constraints {TTF) and i ll let , respectively. From linear programming, the reduced cost of a given 
cluster & and a candidate rate vector c 6 C» is equal to p» — ““ Here. 

rJJ 1 is a not a variable. because it is associated with a given candidate rate vector c € C $ . Thus, 
finding the column with the minimum reduced cost can be performed for one cluster at a time. 
For each cluster 5 € £\<S. the task is to hnd the rate vector index c <= C» for which the reduced 
cost attains its minimum for the given cluster. Recall that the cardinality of C g is fl u * J i9 which 



can he vciy large. However, this task can he equivalently formulated hy the following linear 
optimization formulation. 


E E <A 

(12a) 

•«x. tt 3, 


Y'bM 

(I2h) 

IfJ. 


r y >0,Vj e J,,v* * €l. 

( 12 c) 


In formulation i fTIK rjj € J \,« € 2#. arc the optimization variables. Their values are chosen 
to minimize the objective function 1 1 5ab that represents reduced cost subject to dl 2 bMl 2 cl that 
detine the rate region. 


Remark: As PA is a linear program, the optimum is located at a vertex of the simplex defined 
hy UZhl . Thus the resulting rate vector indeed qualities for formulation P2. ic.. the rate vector, 
represented by optimization variables is one of the elements in Cm □ 

After solving fHl i for each cluster, if min^^ pg — ur* — A* < 0. then the corresponding 
cluster and its rate vector arc added as a new column to augment the master problem < 1171 If 
the minimum is non-negative, then the optimum of P3 with the current «S is also the global 
optimum for P2. The AOCCS operations are given in Algorithm [Q 


Algorithm 1 AM(< N - 

i: Construct P3 with an initial set of clusters 5 

i repeat 

3: Solve the master problem P3. 

4 fur s € S \ S do 

5: Solve the pricing problem PA 

& if p M — — A* < 0 then 

Add the corresponding cluster and rate vector to S and respectively 

* until p# — u;j — A* > 0 

— 

Remark: T he global optimality of Algorithm Q]docs not depend on the specific choice of the 
initial subset «S. For example. could have only one cluster containing all the cells. In Algorithm 





1. 5 and ihc rate vectors tor each s € 5 are successively augmented by new clusters and rale 
vectors, such that the objective function value of P'S becomes improved alter each augmentation. 
Identifying which cluster and rate vector to add is the task in the pricing problem. By linear 
programming theory solving the pricing problem will either lead In a cluster and rate vector 
lor augmenting PS. or conclude none of the remaining cluster* and rate vectors has negative 
reduced cost. In the latter case, global optimality is reached. □ 

The computational bottleneck of AOCCS is on the pricing problem PA. liven if fHI is linear, 
to ensure global optimality JHt needs to be solved for all cluster*, and the number of clusters is 
exponential in the network size. To this end. in the next section we develop an algorithm with 
a control parameter for the trade-off between complexity and optimality. 

Although Algorithm fTJ is presented for static problem input, the column generation approach 
has the potential of addressing system dynamics in respect of the numfvr of users and their 
demands. By column generation, the elements of cluster* and rate vectors are successively added. 
When there is an update in the input, say changed user demand, the algorithm simply starts from 
Stcp0 utilizing the current sets of clusters S and rate vectors C M , s £ «S. i.c.. a warm start, instead 
of optimizing by starting from scratch. If there is a new user, adding zero as the rate for this user 
in the current rate vectors, along with the current scheduling solution at hand <which satisfies 
the demands of all other users), together achieve the warm-start effect. 

V. Local Enumeration Based Bounding Scheme 

’Ihc challenge in dealing with the complexity of the pricing problem lies in the coupling 
relation between cells. Specifically, the interference and hence the rate region of one cell depend 
on the cluster composition, and the number of possible clusters is exponential in the number of 
BSs. 

We introduce a concept that we refer to as local enumeration. The notion is to confine, for each 
cell, the interference consideration to iLs local neighborhood. This is motivated by the fact that, 
for any BS. the interference experienced is dominated by the BSs nearby, whereas interference 
coming from more remote BSs is insignificant. For a cluster and any of its cells, inter-cell 
interference originates from all other cells in the cluster. Suppose we need to determine the cells 
to be grouped together to form a new' cluster in some optimization process (e.g. f solving the 


pricing problem in Section llV-Ct For each candidate cell, there arc 2 l ~ l possible interference 
scenarios, depending on which of the remaining / -1 cells are to be included in the same cluster, 
li we only account for which cells nearby arc included in the cluster in interference calculation, 
the numbrr of combinations of interference scenarios to be considered becomes much smaller. 
As will he clear later on. the si/e of the local neighborhood acts as a control parameter for the 
trade-off between the accuracy of interference estimation and complexity reduction. Moreover, 
the solution scheme via local enumeration allows to compute upper and lower bounds to the 
global optimum of CCS. as well as a near-optimal BS clustering and scheduling solution. 

A. Local Enumeration 

In local enumeration, the interference calculation of each cell is restricted to a selected set of 
cells that are nearby. For cell i. denote by M t the number of cells to be included in interference 
consideration, with 1 < M t < I — 1. The selection of the cells could be, for example, based 
on sorting the cells in I \ (i) using the average interference that each of them generates, if 
active, to the users in cell I. Denote by £, the resulting set of cells after the selection. Then, 
enumeration of the interference scenario lor cell i lakes place for the \1 i cells in £ 4 . That is. the 
enumeration applies to all possible combinations of active cells in £*. giving 2 Mi combinations 
in total, including the case where no cell in £, is selected. We denote £* as the collection of all 
combinations of £, where each combination is augmented with cell t. In other words, only the 
interference from the cells in £, arc exactly accounted for. Parameter Mt controls the size of 
enumeration. Note that if M i ml — 1. then all cells arc part of interference consideration and 
the scheme falls back to global enumeration. 



Fifun* 2. An Uluaraiua i>( to. j] L'lmmtrjtKin. 



An example is given in Figure Q Suppose = A/ s _ 3. meaning that interference from three 
cells will he considered for cell 1 and cell 5. respectively. The resulting cells tor interference 
consideration are L\ * (2,3,4} and £& « {6,7,8}. respectively. Local enumeration ol the cells 
in C t and £5 gives the combinations shown in Table [Q 

TjWc I 

Enumeration 01 £1 and £* for culls 1 and 5 in PiouruQ 

(lMl.2}.(L3( t (l.4). (1,2.3), (L2.4). (L 3.4).<1,2.3.4) 

£«: ( 5 ). (S.6). (5. 7 ), ( 5 , 41 ), ( 6 . 6 . 7 ). ( 6 . 6 . 5 ). ( 5 . 7 . 8 ). ( 6 , 6. 7 . 5 ) 


To avoid potential notational conflict, we denote by M t « {1, —, 2 ** 1 } the index set of £ 
and denote by the set of cells associated with c G Mt. For any c G Mi* the rale region of 
cell t is defined, such that only the activations of cells in N ti \ { 1 } arc accounted for exactly. 
To see the effect, consider as an example two clusters $1 and 53 . with Z #l ■ (1.3,4,5} and 
X %2 * (1,3,4,6,7}. For cell 1. in both eases the corresponding element of £g in local enumeration 
is (3,4). i.e.. the two significant interfere!* in both clusters. Therefore, from cell l*s viewpoint 
the cluster solutions at the network level have a many-to-onc mapping to the elements in Mti- 
leading to dramatically reduced complexity in comparison to enumerating all the 2 / - 1 rate 
regions. 

Recall that parameters bjj(j G *Xi,f G Z*, a G 5) are the coefficients in equation ijhd} of cell 1 
in cluster 5 . With local enumeration, the equation of a cell t is defined with respect to the cells 
in £*. lb avoid any ambiguity in notation, we use py to denote the corresponding parameter for 
user j of cell i. for the interference scenario c G Mi. 

We consider two options of treating the less significant interference from cells outside £*, Vi G 
X. corresponding to the best and worst possible interference scenarios, respectively. In the first 
option, interference from the BSs in l\(£U(t}) is considered zero, no matter of whether they 
arc in the same cluster as cell i or not. Hence the interference is considered for the cells in £, 
only, giving the following definition of the ^-parameter. 

1 



<13> 






In the worst-ease scenario, all BSs outside £, are considered being active concurrently, 
irrespective of the tme status. The resulting parameter definition is given below. 





(14) 


B. Hounding Scheme LESS 


Based on local enumeration, we develop a scheme LliBS to provide lower and upper hounds 
to the global optimum. In LliBS. column generation is applied using the same master problem 
as in Section IIV1 whereas the pacing problem is re-formulated by using local enumeration of 
interference scenarios. In PS. we present the variable definitions of the new formulation for 
pricing, and then the formulation itself. 

( 1 if cell i is selected for cluster formation. 

0 otherwise. 




1 if cluster formation corresponds to c € for cell *. i.c., the active cells in 
A U {«} arc A f a , 

0 otherwise. 

the rate of user j € 3% for c € 


P5: max 5Z X) ~ £ pi 0 ** < 15a) 

*«I c<M, jfj, ifl 

s. i. Y, Wi * Vc e M *< v * e 1 OSb) 

^ y« ■ z«, Vi e 2 (15c) 

€<M, 

Y Vhe A.Vi€i (isd> 

1- Y V«>4. VA€A,Vi€l (15c) 

rl >0,Vje Ji,VceM,v«€i (iso 

{0,i},VceM,v*ei (I5 g ) 

2, € (0,1),Vi el (I5h) 






Similar to Section |IV-C| the objective i ll 5a) is to minimize the reduced cost, or equivalently to 
maximize its negation. The second term in i ll5a) accounts for the total cluster power. For cell i. 
i 15b) defines the rate regions in the local enumeration ot* the interference scenarios, taking into 
account whether or not cell t is to he part of cluster formation. If cell t is selected to he active, 
then exactly one of the scenarios in cell i's local enumeration of interference has to hold true, 
otherwise none of the scenarios will apply. These effects air achieved by i ll3c) . The next tw r o 
sets of inequalities state the relation between clusterin'! at the network level and the resulting 
interference scenarios of local enumeration. Note that, each of the interference scenarios of a cell 
t implies w hich of the cells in C i are active, and vice versa. For example, interference scenario 
{1,2,4} of cell 1 in Figure 0 applies if and only if ccILs 2 and 4 are active (i.c„ part of the 
cluster formation) and cell 3 is inactive. In other words, there must be consistency between the 
2-variablcs and y-varioblcs. This consistency is achieved by 115d| »-« ll5c| » By i !15d| i. for any cell 
i and another cell h that is subject to interference consideration, the latter mast be active (i.e„ 
zs * 1 ) if any of the y-variables corresponding to interference scenarios containing h is set to 
one. Consider again the aforementioned example. If the interference scenario {1,2} is selected 
for cell 1. then Z2 must be one. Inequalities i ll5c) deliver a similar effect lor the opposite case, 
namely the choice of interference sccnano of cell i also dictates the cells that mast be inactive 
in Ci. 

From a .scalability point of view', the strength of P5 is that the interference enumeration is 
limited to the cells in .M*. of which the size is 2* v< — 1 for each ifl. This is in contrast to the 
pricing problem in Section UV-l 1 for which the number of candidate clusters is 2 1 1. As 

contains neighboring BSs with significant interferences only, typically M% / without much 
loss of accuracy. Moreover. Afj can be used as a control parameter for the trade-off between 
accuracy and computation. 

Remark: At the optimum of P5. the cluster solution is given by cells for which z* ■ 1,V* € X. 
For each of such cells, there is an optimal rate vector corresponding to a vertex of the simplex 
defined by i 1 5b) . because the objective function is linear in rate. Thus the cluster and the rate 
vector obtained from solving Ph arc similar to the columas in P2. in the sense that for any cell 
in the clastcr, exactly one user will attain a positive rate, and the other users have zero rates. □ 

In solving i ff?) , the parameters ^ (j € Ji £ Ij,e € Mi) arc set to 0^ nr ^ in fH} and 







fTT| i. corresponding to treating the BSs outside the local enumeration <LE| scope C t to he all 
non-active and all active, respectively. We use “LE-ofP and ~LE-on” to respectively refer to 
the two settings. 'ITicse settings, when embedded into the column generation algorithm AOCCS. 
yield lower and upper bounds confining the global optimum. This result is formalized below. 


Theorem 7. Denote by £T the global optimum of CCS, and E*£ M q and the optimal values 
from embedding LE-vff and LE-on into column generation, respectively . Then E* is: ^ < E* < 




Proof: Denote by C(1 the set of clusters in the optimal solution from the LE-on scheme. 
For any cluster .s € the interference scenario in the local enumeration for cell i € 2*. 

induced by s. is the index element c € such that Af# m (£ 4 U {t}) C\2 g . Denote by e^s) 
the index of this interference scenario. From the remark above the theorem, for each s € 5£ K4Ifc 
and cell i € 2*. exactly one user of i. say has positive rate ■ $*!<•) • whereas all other 
users of J x carry zero rates. 

Consider replacing the rate of p by r[y * while keeping the zero rates of the other users 
nf cell i. By definition, /£,(.), C I, in Lli-on. Therefore E«(a/.„. i .\(*)1o(1\(C.u( 0))P*^'* 2 
Pron» €Q and Q3). V tj . < and thus rj,. > rJj. W . Performing this rate 
update for all cells in I§. we obtain a column c £ C* in P2 for cluster s . with a rate vector 
such that the values arc at least as high as those in the rale vector in the solution of LE-on with 
ihc same cluster, and the non-zero elements coincide in their positions. Thus for the same time 
duration of each s € denying the corresponding columns of P2 gives a feasible, though 

not necessarily optimal, solution of P2. Hence E * < Ef li iU . 

For the second inequality, the idea of the proof is analogous, though the starting point is the 
globally optimal set of clusters of P2 . I hc proof consists in observing that each cluster and its 
associated rate vector correspond to a solution that is potentially relumed by solving P5. but 
with the same or higher rate values: the latter is because for any cluster 5. i € I*, interference 
scenario Ci(s), and j € *7i- we have 3*^ < &?j- By the theory of column generation in linear 
programming E3| . < bjj implies that the optimal value from LE-off will not undcr-pcrform 

£\ hence E* < E\ ■ 


C Near-Optimal Solution Based an LEBS 

LEBS not only provides bounds In the global optimum, but also enables the computation of 
a feasible solution of CCS. From the proof of Theorem 0 for LE-an, storting from and 

the rate allocation for each s £ and replacing each positive rate value with that derived 

from leads to a feasible solution of P2. Note that the cardinality of «S£j^„ is at most J + 1. 
thus computing this feasible solution comes with little additional effort. The idea leads to the 
following near-optimal cluster scheduling approach (NCSA). 

!) S 4- S^ £aio . 

2) If > 0. r* f 4- otherwise r* 4-0. V) € J„Vt € Z.,Vs € S. 

3) Solve Pi to optimality. 

Note that the rate values used in LE-on are pessimistic, ic.. they air equal to or lower than 
the values derived from ©. Thus the total energy given by NCSA, denoted by Escsa • improves 
that of LE-on. giving the corollary below. 


Corollary 8. E? < EX CS . < E:„ m 


Note that a feasible solution may be derived from LE-olf as well. However, since in LE-off 
the rate values arc on the optimistic side, there is no guarantee that the scheduling time limit T 
can be irspcctcd after replacing the rate values with those obtained from accurate interference 
calculation. 


VI. Performance Evaluation 

A Experimental Setup 

Two networks consisting of seven and nineteen cells, respectively, have been used in the 
simulations, sec Figure 0 Each BS serves live randomly and uniformly distributed users within 
the cell's area. The networks operate at 2 GHz. Following the LTE standards, we use one resource 
block to represent a resource unit with ISO kll/. bandwidth in the simulation. The total bandwidth 
amounts to 4.5 MHz. The channel gain consists of path loss and shadowing fading. The path 
loss follows the COST-231-HATA model. For shadowing, the log-normal distribution with 8 dB 
standard deviation is used. l ; or each network, we generate one hundred instances and consider 
the average pcrtormancc. Motivated by the results in fl?| . (T8| . we set cell's load / ■ 1. In 


Algorithm |TJ is initially set to contain all clusters of size two. with C M * C\ tor each s € S. 
Table [0 summarizes the key simulation parameters. 




Figure 3. Network* few pertamiarve evdodiiKk 
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Parameter 

Value 

Cell radius 
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2 GHz 

Ti*al huduidth per cell 

4.5 MHz 
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IBO kHz 

Nurater oi mo* per cell 

5 

L l »er cfcnurri d t; 

2 Mbits 

Path loss 

COST-13 MIATA 

Shiikmmp 

Lugncnm! S JH doutuxi 

Transmit pwvvrr per Rl 1 

1 W 

Circuit power pj fvf US 

5 W 

Noise power spectral density 

-174 dBin/Hz 

Lind per BS 

1.0 


Among the algorithms. AOCCS guarantees global optimality {see also the remark in Sec¬ 
tion II V-CI . however it is not intended for large networks. Algorithm NCSA is a sub-optimal 
algorithm providing a heuristic solution, by means of local enumeration by which the pricing 
problem is of polynomial si/r. The purpose of LliHS Ls to deliver hounds on global optimum 
{which is hard to compute for large networks>. and thereby enable to evaluate NCSA in terms 
of the deviation from global optimality. In the following, we present and compare the results of 
these algorithms. 




ft Energy Optimization by AOCCS niul NCSA 


To evaluate the performance of the proposed AOCCS and NCSA, the conventional scheme 
“All-on” (see Section EEd and a scheme called “BS Switch-off Pattern Strategy (BSPS)" 
proposed in ffOl . have been implemented for comparison. For BSPS, five activation patterns, 
referred to as All-on. I. II. III. IV. are proposed in 1101 . The first pattern coincides with our 
“All-on" scheme defined in Section llll-Dl The other four patterns arc composed by cell subsets 
with decreasing cardinality. In Qfl|. one of the patterns is chosen at a time based on the level of 
user demand. We remark that inter-eel! interference is not considered for analytical simplicity 
t° HOI For our simulation, however, we account for inter-cell interference in the comparison. 
For the comparative study, we consider the best achievable performance of BSPS, by allowing 
mixed and optimized use of its patterns. This is earned out by generating cell clusters based 
on the patterns in IlIQI . followed by solving the resulting optimization formulation © to global 
optimality. 

Table 111 
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T-2 
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AOCCS 
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:. •• ? 

BSPS in Pffl 
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HM7I 
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V U*oii 

105.15 
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We examine the sum energy for various values of the delay limit T . The results arc summarized 
in Table [ITT] For the NCSA results in the table. M % equals 5 and 7, respectively, for the 7-cell 
and I9-ccll networks. Note that the table docs not include the results of A(X'CS for the 19-cell 
network, because the global optimum for this network si/e is beyond the reach of AOCCS. The 
TDMA scheme (see Section Ull-bt is not included since TDMA is infeasible for the delay limits 
used in Table [TIT] 




















Wc make the following observations from the results in Table [ITT] First, except for All-on 
that is insensitive to T by design, higher QoS requirement (Le., smaller T ) requires higher sum 
energy. The amount of energy difference is, however, relatively small for the largest and smallest 
values of T. Thus having a larger time limit, or. equivalently, lower QoS requirement, does not 
give significant reduction of energy consumption. From the results, energy saving comes mainly 
from optimizing cell cluster formation and activation time duration. 

AOCCS leads to the global optimum and hence the minimum sum energy, whereas All-on 
requires the highest energy consumption by its nature, as can been seen in the table. Among 
the sub-optimal schemes. NCSA yields the best performance. Indeed, for the 7-cdl network 
NCSA consistently achieves less than 1% deviation from global optimality. The BSPS scheme 
performs rather close to global optimality for the 7-ccll network. For the network with larger 
size, however. NCSA leads to significantly better results. 

C Solution Characteristicx 

To gam further insights, wc consider the average number of activations of the cells and the 
average data rate of the users in TOMA. All-on. and the optimal schedules for T ■ 1 and T ■ 4. 
The results arc displayed in Figure 0 for the 7-ccll network. 



Figure 4. The awnigt aumlvi lit' veil iUnjuonv jiuI ntr xt tipiuiium. 

For TDMA. every cell is activated as many times as the number of users in the cell in Figure 0] 
This observation verifies Lemma 0 that is. TOMA at the BS level also implies time-division 
access of its users. Because the users are served one at a time in TDMA, the rate is the highest 















possible, as can be seen from Figure 0b). By Theorem 0 one would expect that, when the 
time limit of serving the user demand becomes more restrictive, the optimal schedule has to use 
cluster* of larger size, and consequently it is more likely that a BS will appear in multiple clusters 
for activation. This is confirmed by comparing the results fur T ■ 1 and T = 4 m Figure [0a). 
Note that, although the average user rate is lower for small T in Figure 0b). the demand can 
still be served in shorter time because of the increased number of activations. For All-on. there 
is no interruption in transmission, though the user rate is lowest due tn inter-cell interference 
among all the BSs. We note that for All-on. the optimal schedule uses only one cluster of all 
the BSs. but the cluster is activated with multiple rate vectors with optimized activation time 
durations. From Figure 0a), the number of rate vectors used is less than J -f 1 ■ 36. which is 
consistent with larmma 0 

D. Performance of LEBS in Hounding Optimum 

We examine the accuracy of the estimation of glohal optimum via LEBS. and set this in 
perspective to AOCCS and NCSA. The results, given as sum energy versus delay limit T. arc 
shown in Figures[5]and[5] For a comprehensive performance picture, Af, is successively increased 
in the two figures. For each value of Af*. a pair of markers is used to show the upper and lower 
bounds of the global optimum. The gaps between the upper and lower bounds from LEBS. 
averaged over T lor selected values of A/,, arc further detailed in Table [TV] In addition to setting 
Mt uniformly for all cells. Table [IV] also contains results of setting Af* to be the number of cell 
i’s one-hop neighbor cells. l ; or example, in the 7-ccll network. Af* ■ 6 for the center cell and 
Af, » 3 for the other cells. The results obtained with this setting is referred to os “Neighbor-Af,* 4 . 
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From the two figures and Table llVI augmenting the si/c of local enumeration of interference 
lie., parameter A/*) leads to progressively tighter hounding intervals. Note that, even with A/, 
being as small as one. that is. only a single neighboring BS is accounted for. the accuracy 
remains satisfactory - the relative difference of the upper and lower bounds of global optimum 
is less Uian 3% and 12%. respectively, for the two networks. We observe that when T increases, 
the lower bound from LE-off tends to improve in relation to AOCCS or NCSA, whereas the 
upper bound from LE-on does not. This is because LE-on over-estimates interference, and for 
large T the error grows because optimal clusters tend to be small (cf. Theorem 0. For LE-ofV, 
increasing T has the reverse effect. 
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NCSA combines LE-on with post-processing. From Figure 0 NCSA performs extremely close 
to global optimum for the 7-cell network - the relative deviation is merely 0.7% or less. For 
the 19-eel! network, global optimum is not available for evaluating NCSA. However, the lower 
bound of global optimum, derived from LE-oH*. reveals that the deviation from global optimum 
is within 1%. This demonstrates the performance of NCSA as well as the usefulness ol the 
bounding scheme. Moreover, from the last row of Table [TV] setting M t based on the number of 
one-hop neighbor significantly outperforms uniformly setting A/, ■ 5. while the problem si/e.s 
in LEBS arc comparable for the two settings. The cell-adaptive choice of Mi achieves similar 
performance as setting i\f 4 * 7. However, the problem size is considerably smaller in the former 
because A/< < 7 for most ccILs. 











VII. Conclusions 


Wc have considered optimal base station clustering and scheduling with the objective of 
minimizing energy consumption. Theoretical insights and mathematical formulations have been 
provided. For problem solution, wc have presented a column generation approach, as well as 
a local enumeration scheme. The latter effectively addresses the difficulty of optimal cluster 
formation that is of combinatorial nature. Integrating column generation with local enumeration 
not only leads to flexibility in balancing optimality with scalability, but also yields lower and 
upper bounds confining the global optimum. Numerical results demonstrate that the algorithmic 
notions result in significant improvement in energy saving in comparison to existing schemes. 
In addition, the BS clustering and scheduling solutions that have been obtained arc very close 
to global optimum. 

The wort: in this paper provides a theoretical framework of optimizing BS clustering and 
activation. The proposed framework can be potentially implemented using the almost blank 
subframes (ABS) scheme defined in 3GFP Release 10. The BSs during their deactivation time 
durations can be set to the ABS mode, in which only control channels can be used with very 
low power, whereas the active BSs are in norma! transmission mode. Also, from a scalability 
standpoint, the use of NCSA with local enumeration of interference has two implications. First, 
the problem size grows only linearly instead of exponentially in the number of BSs. Second, 
performance calculation for each BS needs to consider the neighboring BSs only. As such, the 
signaling cost for implementing the framework is reasonable. 

An extension of the current work is to investigate the potential of power control. Base station 
clustering with cooperative multi-point transmission is another topic for future studies. 
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